Copyright © 1972 American Telephone and Telegraph Company 

The Bell System Technical Journal 

Vol. SI, No. 9, Novemher, 1972 

Printed in U.S.A. 



On Fast Start-Up Data Communication 

Systems Using Pseudo-Random 

Training Sequences 

By R. W. CHANG and E. Y. HO 

(Manuscript received June 21, 1972) 

This paper analyzes the start-up performance of automatic transversal 
equalizers when maximum-length pseudo-random sequences oj short 
periods are selected as the training signals for fast start-up purposes. 
Single-sideband Nyquist systems are considered because they represent 
the limiting case of vestigial-sideband systems with small excess band- 
width. It is shown that the equalizer is capable of fast start-up except 
in some rare situations which can be avoided by using proper timing, 
phase, and equalizer initial settings. The results also show that the equal- 
izer tap convergence rate is independent of the phase characteristic of the 
communication channel and of the choice of the pseudo-random sequences 
which have the same period. 

The equalizer is set up in the training period by minimizing the mean- 
square error between the equalizer output and the transmitted pseudo- 
random sequence, which is different from the mean-square error for random 
data. Surprisingly, ice have found that, even for pseudo-random sequences 
of very short periods, this start-up algorithm results in only a slight deg- 
radation in system performance. Accordingly, good system performance 
can be expected immediately after the system switches from the training 
mode to the data mode. 

I. INTRODUCTION 

Pseudo-random sequences have been used in the past as training 
signals for setting up automatic transversal equalizers during start-up 
periods. 1 ' 2 For fast start-up, it is desirable to know how the equalizer 
settling time depends on the choice of the pseudo-random sequence, 
the channel characteristics, and the initial receiver conditions. These 
problems are examined in the first part of this paper for single-sideband 
Nyquist systems. We present basic theories from which the reader 
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can work out numerical examples of special interest. The results are 
compared with those obtained previously 3 for a different class of training 
signals (isolated test pulses). An important difference between these 
two cases is pointed out. 

When pseudo-random sequences are used, it is most convenient 
to adjust the equalizer tap gains to minimize the mean-square error 
between the equalizer output and the transmitted pseudo-random 
sequence. It is not immediately clear how closely this simple algorithm 
optimizes the data set performance for transmission of random data 
(because an equalizer setting optimum for pseudo-random sequence 
transmission is not necessarily optimum for random data transmission, 
particularly when pseudo-random sequences with very short periods 
are used for fast start-up purpose). This problem is examined in Section 

IV and the analysis is illustrated by examples. 

Section V summarizes the results of this paper. The reader mainly 
interested in the conclusions and their implications may read Section 

V next. 

II. MATHEMATICAL MODEL AND FUNDAMENTALS 

An amplitude modulation data communication system with a con- 
ventional tapped delay line transversal equalizer is depicted in Fig. 1. 
During data transmission, the transmitter transmits the information 
digits, [di], sequentially at time instants t = ••• , t\ — T, i x , 
ti + T, • • ' . The equalizer output is sampled sequentially at the symbol 
rate to recover the information digits. Let the tth equalizer output 
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Fig. 1 — Block diagram of an amplitude-modulation data communication system. 
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sample be ?/, . We adopt the familiar mean-square error (MSE) criterion 
and adjust the gain controls of the equalizer to minimize the MSE 
between y< and d { . We assume {d,-} is an ergodic process, hence the 
mean-square error can be written as 

8 - Urn - y\ (yi - d,) a 

= ((Vi - d,) 2 ) (1) 

where (x) denotes the time average of x. 
It can be seen from Fig. 1 that 

y(t) = 2 C k x(t - (k - l)D, (2a) 

and 



x(t) = E dUk(f - »T), 



(2b) 



where MO is the overall system (without equalizer) impulse response. 
For the sake of simplicity, we shall shift the time origin and use the 
abbreviations y { = y(iT), x { = x(iT), and /*, = h(iT). Thus (1) can 
be written as 



where 



6 = C'AC - 2CV + (<%), 
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i - 1,2, ... ,N, j = 1,2, ••■ ,N, (6) 
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and 



V = 



= 2 (di d m )hi- k+1 -, 



7, 

V 2 



(7) 



fc = l,2, ... ,N. 



(8) 



Let d&/dd be the partial derivative of 8 with respect to C, , i = 1 
to N, and let d8/dC represent an N X 1 column vector the ith element 
of which is dS/dC, . From (3), we obtain 



^ = 2AC-2V. 



(9) 



The optimum value of C that minimizes the MSE S is denoted C opt . 
It is clear from (9) that 

C opt = A- X V. (10) 

Thus, the minimum value of 8 when C = C opt is 

S rain = (d\) - V'A'V. (11) 

Let e denote the difference between C and C opt ; i.e., 

e = C - A-'V. (12) 

Now consider the adjustment of the equalizer. As is well known, 
the equalizer can be adjusted in the training period by transmitting 
either a succession of isolated test pulses or a special class of pseudo- 
random sequence (pseudo-noise sequence 3 ). The case of sending isolated 
test pulses during the training period has been considered by Chang. 4 
In this paper, we examine the case of sending pseudo-noise sequences^ 

In the training period, a binary pseudo-noise sequence is applied 
to the transmitter input. Since an adjustment is made at the end of 
each training sequence period and since we wish to make the largest 
number of adjustments during a fixed training time, we consider the 
shortest possible pseudo-random sequence period (i.e., the case where 

f The best known sequences of this type are the m-sequences (also known as 
maximal-length linear recurring sequences or maximum-length pseudo-random 
sequences). 

* When the period of pseudo-random sequence is shorter than the length of the 
transversed equalizer, the analysis is difficult because A~ l may not exist. 
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the period of the pseudo-noise sequence is equal to the length of the 
transversal equalizer). 

Let the pseudo-noise sequence be denoted by /3i/3 2 /? 3 • • • fj N /3i/3 2 /3 3 ■ • • 
fi k , , where /3*< is the last bit. From (2b), the input to the equalizer 
can be written as 

x(t) = E&A(«- kT). (2b) 

Practically speaking, we may assume that h(t) is time limited. Then 
it can be shown that for t larger than a certain value, say t , x(t) will 
be a periodic function of period NT; i.e., 

x(t) = x(t + NT), l n < t and t + NT ^ k'T. (13) 

In the training period, the values of a,, , V k , 8, C opt , A, and V are 
denoted by a* , V* k , 8*, C* pt , A*, and V*, respectively. From (6), 
we obtain 

aft =EZ (fiMh-i+i-h-i+i-n 

= 2_/ L-l h-i + l-mh-j + l-m+kft 
k m 

~ W E E E /l_ 1 + i_ m /i-y + i- m+ tAr + | all i, j. (6) 

■W k l = \ m 

From (8) 

j JV-1 

= /_, h-k+i + jN t^ 2~t 2-i h-k+i+jN+i , k = 1, 2, • • , N 

j = integers. (8) 

The partial derivative dS*/dC, , i = 1 to N, can be computed from 
each block of N samples of x(t), and the gain control C, is adjusted 
by an amount proportional to d8*/dC, at the end of each block. For 
example, d&*/dd is computed from z,- +I to x itN and C, is adjusted 
after x i+N . Then dS*/dC, is computed from x i+N+l to x i+2 y and C, is 
adjusted after .r.+g.v • The optimum tap setting that minimizes the 
MSE, 6*, is 

C *. = (A*)- X V*. 
We now proceed to examine the convergence of 8*. 
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III. SINGLE-SIDEBAND NYQUIST SYSTEMS 

In this section, we consider single-sideband data communication 
systems which transmit at the Nyquist rate with sin x/x pulses (here- 
after referred to as single-sideband Nyquist system). Such systems 
are considered because they represent the limiting case of sharp rolloff 
vestigial-sideband systems. 

The transfer functions of the transmitting filter, transmission medium, 
and receiving filter (see Fig. 1) are f\(/), F 2 (/) and F 3 (J), respectively. 
The F t {j) are of the following form 

Fi(S) = \F i {1)\e JPiln , i= 1,2,3 (14) 

where J is used to denote the imaginary number V— 1. 

In a single-sideband Nyquist system, | F x {j) | and | F 3 (f) | are specified 

by 



(15) 

and 

I F a (f) | = 1 /, = | / | = U 

(16) 

In general, with lower single-sideband transmission, the carrier fre- 
quency, j e , is set equal to / 2 . Let H(j) denote the Fourier transform 
of h(t), which is the overall system impulse response at the equalizer 
input. It can be shown that 

H(J) - * | ft(f - fc) I 

J[0i(f-fc)+P,(f-fc)+0,(f-f c )-2r(f-fc)ti+9] q < t < , _ i 

= I I F 2 (j + U) | 

J[P,(f + fc)+P,(f + fc)+P,(f + fc)-2T(f + fc)t>-8] _,, _ r •> < i < q 

= otherwise, fl7) 

where 9 represents demodulating carrier phase. The signaling interval is 

Since the time samples h(iT), i = • • • 0, 1, 2, ■ • • are taken at the 
Nyquist rate, we obtain from the sampling theorem, Parseval's theorem, 



F 1 (j) | = 1 


/i = 1 / 1 = /. 


= 


othenvise, 


F a (1) | = 1 


/. = 1 / 1 = u 


= 


otherwise. 
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and (17) 

00 

g(i — j) - £ h-t+i-Ji-i+i-n, 

m =— oo 

- 2(/ a - /i) /" M< - & + f)*(I - jT + T) dt 

= (/, - /O r ' [cos 2*f(i - j)T][\ F 2 (f - /.) |] 2 df. (19) 
Substituting (19) into (6), we obtain 
of* = L 0(* - 3 + MO - Iff £ 2 00 - i + fcAT + all z, j. (20) 



ff * 



1=1 



It is clear from (19) and (20) that a* is independent of the demodulating 
carrier phase 6, the system timing t x , and the phase characteristics 
/?,-(/) of the system. We also note that for a fixed N, a* is independent 
of the choice of the pseudo-noise sequence. [The pseudo-noise sequence 
0i , Pa , ■ • • does not appear in (19) or (20).] Using the method in Ref. 4, 
it is concluded that the equalizer tap convergence rate is independent 
of the demodulating carrier phase 0, the system timing f, , the phase 
characteristics of the system, and the choice of the pseudo-noise se- 
quence for fixed N. 

Note from (19) and (20) that a* depends on the amplitude char- 
acteristic | F 2 (f) | of the transmission medium. Since amplitude dis- 
tortion is not severe in private line systems, in the following discussion 
we assume that 

I F 2 a) \ = i, nzfzn. (2i) 

Substituting (21) into (19), and neglecting a normalizing constant 
(/a — U) 2 , we obtain 

g(i - j) = 1, i = j 

= 0, i 9* j. (22) 

Substituting (22) into (20) gives 

afi = 1, i = j 

= -± i* j- (23) 



Therefore, the eigenvalues of A* are 
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\ k = l+£, k = l,2, ■■• ,N -1 

A* = ^- (24) 

The eigenvector u N corresponding to X^ is an N X 1 vector whose 
elements are all unity. Since all but the last eigenvalue are equal, 
the equalizer can settle rapidly except in the case where the initial 8* 
contains a large component (e' u N ) 2 \ N , where e is the initial tap setting 
error vector 3 . Since X^ is small and since (e' u N ) 2 cannot be exceptionally 
large with proper timing, phase, and initial equalizer settings*, it is 
very unlikely that 8* would contain a large component (ebu N ) 2 \ff . 
Therefore, the equalizer can settle rapidly in the training period. 

IV. FURTHER ANALYSIS OF SYSTEM PERFORMANCE 

At the end of the training period, the equalizer taps are set very 
nearly to C* Bt . The data set is then switched to the data transmission 
mode. Since statistics of the true data differ from those of the training 
pseudo-random sequence, the optimum tap settings, C* pt , obtained 
for a training sequence cannot also be the optimum one for the true 
data. Thus, system performance degradation during the early stage 
of data transmission is expected, even if the data set is equipped with 
an adaptive equalizer. We now proceed to determine this degradation. 

We assume zero-mean independent information digits (binary or 
multilevel). The signal level is normalized such that 

<<£> = 1. (25) 

The mean-square error, 8, can be obtained from (3), 

8 = (C* t )'A(C* - 2(C* t )'V + 1, (26) 

where A and V are given by (5) and (7), respectively. From (6), (8), 
(19), and (25), we obtain 

an = g(i - j), (27) 

and 

V k = h. k+1 . (28) 

The a* and V* can be rewritten as 

aft = a.-, + A U (N), (29) 



* For example, one may use the method described in Ref . 1 . 
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and 

Vf = V k + y k (N), (30) 

where 

AoW = E gd -i + ^)-^Ef gii - j + kN + 0, (31) 
and 

j JV-l 

7*(N) = £ h-k + i + iN - T? £ 2 h- k+1+iN+l . (32) 

From (26), we have 

S = C^ pt AC opt - 2C opt V + 1 + (5C)'A(5C) 

= 8 min + (5C)'A(5C), (33) 

where C opl is the optimum tap setting for the true data and 5C is the 
difference between C* pt and C opt , 

5C = C* , - C &pt . (34) 

The last term in (33) represents the system performance degradation 
and is non-negative. 

The mean-square error during the early stage of data transmission 
can now be determined from (33) and (34). As the period of the training 
PN sequence approaches infinity, lim^*, A tl (N) — > 0, limw^ y k (N) —* 0, 
and lim A r_ 0O Sc — > 0. Hence, C* pt approaches C opt asymptotically as 
N increases. 

We now assume some specific channel characteristics and apply 
these formulas to determine the initial performance degradation. 
Example 1 : A baseband channel with a flat amplitude characteristic 
and a typical quadratic delay characteristic 5 is assumed. The delay 
at the Nyquist frequency is taken to be /3 m 7 7 seconds. The phase char- 
acteristic is of the form 

&(/) = Sw(3 m T 3 f/3. (35) 

The system impulse response, h(t), can be calculated from 

h(t) = 2 f cos {2irjt + /3(/)) df. (36) 

Jo 

In this example, we consider a typical value /3,„ = 2. One hundred one 
samples of h(t) (from t - 50T to t u + oOT) are taken with T = 1 and 
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t = 0.67 7 . For a 7-tap equalizer, the minimum mean-square error 
attainable is 0.03347. The mean-square error obtained from (33) is 
0.0393. The results for a 15-tap equalizer are 0.01484 and 0.01785, 
respectively. It is clear from these numbers that the performance 
degradation caused by using a PN sequence in the training period is 
negligible. This can be further illustrated by sketching the vector 5C 
in (34). Since the amplitude characteristic is constant, we have 



and 
where 
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(37b) 



(37c) 



The inverse of A* is found to be 



N — 1 

(a*)- 1 = i + ^-r4 i ^ 



N 2 



N + 1 * ' N + 1 
From (10), (34), (37a), and (38), we have 



5C = ^i v + 



2N 



Y + 



N 2 



JV + 1 ' AT + 1 ' ' N + 1 
Substituting (28), (32), and (37c) into (39), we obtain 



AV+ A Y }. 



(38) 



(39) 



5C = 



Z-* *iur-i)/»]+ii+<» 



2> 



-UN-l)/2] + to + iN 



(40) 



In Fig. 2, the time samples h k , k = — oe to » , of h(t) are sketched. 
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Fig. 2 — Illustration of the elements of 5C 



For N = 7 (7-tap equalizer), the first element of 8C is the sum of the 
infinite sequence • ■ • , h- n , h- 4 , h 10 , • • • ; the second element of 5C 
is the sum of the infinite sequence • • • , h- l2 , k- B , h 9 , • • • ; etc. It 
can be seen that the large time samples h- 3 to /j 3 are not included in 
these sums. This explains why SC should have small elements. By 
repeating the above for N = 15, one can easily see that the performance 
degradation due to the use of PN training sequence is small and that 
this degradation approaches zero as N increases. 
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Example 2: We continue example 1 but change fi m to its minimum 
value zero. The infinite sum ^,> A-t+t.+w can now De evaluated 
in closed form. By using the sampling theorem or the formulas for 
Psi (Digamma) functions, 6 it is obtained after some manipulations 

Ei __ / -,\-*+jv sin irt 

where ^ t ^ 0.5T is assumed. The performance degradation can 
now be determined in closed form 

W-.U/1 a : n 2_ / 

(5C)'A(5C) - £ W 8 

ft [JV-U/2 « *» 

^_ f ( C ^J^ +tff ^)}' (42) 

S min and (&C)'A(5C) are plotted in Fig. 3 for N = 7, 15, and 31 and 
t = 0.05, 0.1, 0.15, 0.2 and 0.25. It can be seen that the value 
of (5C)'A(5C) is approximately an order of magnitude less than that 
of 8 min . Also note that (5C)'A(8C) reduces almost by half when N is 
doubled. These results again show that the performance degradation 
caused by using PN training sequence is negligible. 

V. CONCLUSIONS AND DISCUSSIONS 

We have analyzed the start-up performance of a transversal equalizer 
for the case where a maximum-length pseudo-random sequence is 
used as the training signal to adjust the equalizer in the training period. 
The equalizer taps are adjusted by the gradient method to minimize 
the mean-square error, 8*, between the equalizer output and the trans- 
mitted pseudo-random sequence. The pseudo-random sequence has 
been denoted fa , j8 2 , • • • , fa , fa , fa , • • • , where JV is the period of 
the sequence. We have considered the case where N is equal to the 
number of taps of the equalizer. The following results are obtained: 

(i) For a fixed N, the initial value of the mean-square error &*, 
the convergence rate of 8*, and the minimum value of 8* are 
all independent of the specific values of the k 's. Therefore, the 
same performance is obtained with any of the many pseudo- 
random sequences available. For example, a maximum-length 
pseudo-random sequence can be cyclic shifted to produce N 
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Fig. 3 — Computational results of Example 2. 

pseudo-random sequences. Any one of these N sequences can 
be used in the training period with the same result. 

(u) The initial value of 8* depends on the phase characteristic of 
the communication channel, and the timing and phase settings 
at the receiver. However, the convergence rate of £* is inde- 
pendent of all these parameters. This result is similar to the 
one obtained previously 4 for the case where isolated test pulses 
are used as the training signal. 

(in) Unlike the isolated test pulse case, the eigenvalues of the cor- 
relation matrix here are not close together. For example, when 
the channel has a flat amplitude characteristic, the first N — 1 
eigenvalues Xj to X^-i are equal to 1 + 1/N, while the iVth 
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eigenvalue X^ is equal to 1/N. The mean-square error can be 
decomposed into N components, each associated with one of 
the N eigenvalues. The tap gain adjustment reduces rapidly 
the N — 1 components associated with Xi to X^-i , but the 
component associated with \ N decreases very slowly. Therefore, 
as discussed at the end of Section III, care must be exercised 
in setting the timing, phase, and equalizer taps at the beginning 
of the training period so that the component associated with X^ 
has a small initial value. Note that this precaution is not re- 
quired when isolated test pulses are used, because in that case 
the eigenvalues are all close together and the components of 
the mean-square error all decrease rapidly. 
(iv) The analysis shows that the tap settings obtained with 
maximum-length pseudo-random sequences with very short 
periods are nearly optimum for random data transmission. 
More specifically, the equalizer taps are adjusted in the training 
period to minimize the mean-square error 8* between the 
equalizer output and the transmitted pseudo-random sequence. 
When such tap settings are used for actual data transmission, 
the mean-square error between the equalizer output and the 
transmitted random data can be written as 

S = S min + e 

where 8 min is the minimum attainable value of 8, and e is non- 
negative, because tap settings obtained with pseudo-random 
sequence do not necessarily minimize 8. Formulas for computing 
e were developed in Section IV and illustrated by numerical 
examples. It can be seen from these formulas and Figs. 2 and 3 
that e decreases rapidly as N increases (N is the period of the 
pseudo-random sequence and also the number of equalizer 
taps). The computations show that e decreases approximately 
by the factor 1/N (for example, e reduces approximately by 
half when N is doubled). The computations also show that e 
is about an order of magnitude less than 8 min . (This is so for 
N as small as seven.) Therefore, tap settings obtained with 
pseudo-random sequences are nearly optimum for actual data 
transmission. 
(v) As can be seen from the computations in Section IV, 8 min is 
rather large when N is small. For example, for a system with 
typical channel delay distortion (see example 1) and S/N = 
30 dB, 8 min can be 15 dB above the thermal noise level when 
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N = 7, and 12 dB above the thermal noise level when N = 15. 
These large mean-square errors are due to the fact that for 
single-sideband Nyquist systems the overall system impulse 
response decays very slowly with time. Thus, for very sharp 
rolloff VSB systems (such as 4-percent rolloff), it is necessary 
to use a large number of equalizer taps (such as 31 or more). 
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